Studies of an Event Building algorithm of the readout system for the twin TPCs in HFRS* 
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The High Energy Fragment Separator (HFRS), which is currently under construction, is a leading inter- 
national radioactive beam device. Multiple sets of position-sensitive Twin Time Projection Chamber (TPC) 
detectors are distributed on HFRS for particle identification and beam monitoring. The twin TPCs’ readout 
electronics system operates in a trigger-less mode due to its high counting rate, leading to a challenge of han- 
dling large amounts of data. To address this problem, we introduced an event-building algorithm. This algorithm 
employs a hierarchical processing strategy to compress data during transmission and aggregation. In addition, 
it reconstructs twin TPCs’ events online and stores only the reconstructed particle information, which signifi- 
cantly reduces the burden on data transmission and storage resources. Simulation studies demonstrated that the 
algorithm accurately matches twin TPCs’ events and reduces more than 98% of the data volume at a counting 


rate of 500 kHz/channel. 
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I. INTRODUCTION 


With the ongoing development of superconducting heavy- 
ion accelerator technology, developed countries are proposing 
the construction of next-generation (third-generation) large 
scientific devices to obtain higher-intensity and higher-energy 
radioactive nuclear beams. They aimed to expand their re- 
search into heavier and closer-to-the-drip-line nuclide ar- 
eas [1-3]. Currently, facilities such as BigRIPS [4] at RIKEN 
in Japan and ARIS [5] at MSU in the United States have been 
completed and are in operation, whereas others such as Su- 
perFRS [6] at GSI in Germany and the High Energy Fragment 
Separator (HFRS) [7—10] at the Institute of Modern Physics, 
Chinese Academy of Sciences, are under construction. Once 
completed, the HFRS will be a new generation of radioactive 
beam devices with higher beam energy and magnetic rigidity 
(Bp) and will significantly enhance the experimental capa- 
bility in the medium-heavy nuclear region. Given the high- 
energy and high-intensity properties of HFRS, it is necessary 
to implement a highly reliable identification of nuclides at 
high counting rates (10 MHz) and meet the large dynamic 
range requirements (Z = 1 ~ 92), which poses a great chal- 
lenge to radioactive particle detection and readout electronics 
technology. The Time Projection Chamber (TPC) [11-14] 
is a highly efficient and high-resolution detector for tracking 
particles that allows for the reconstruction of particle motion, 
momentum, and energy loss. It plays a crucial role in par- 
ticle physics research and has been widely used in nuclear 
physics experiments such as the ALICE [15-17] Large Col- 
lision Experiment at CERN, the RHIC-STAR [18, 19] ex- 
periment at Brookhaven National Laboratory, and the CSR 
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External-Target Experiment (CEE) [20, 21] currently under 
construction in China. Owing to its high reliability, the TPC 
has been chosen as a position-sensitive detector for particle 
identification and beam monitoring. However, as the count- 
ing rate of the projectiles increases, there is a strong possi- 
bility that the readout signals from different particles expe- 
rience temporal confusion. For instance, when the counting 
rate is 10 MHz, the drift distance is 5 cm, and the drift ve- 
locity is 5 cm/us, particles are incident on the detectors per 
100 ns, and the maximum drift time is 1 us. Under such cir- 
cumstances, the probability of post-incident particle sensing 
signals on the readout strips before the pre-incident particle 
is as high as 90%. To address this issue, two TPC detec- 
tors with identical configurations placed in close proximity 
are inverted to form a set of twin TPCs [22], as illustrated 
in Fig. 1. When the same beam particle passes through the 
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Fig. 1. (Color online)I[llustration of the twin TPCs. (a) Schematic 
diagram of two beam particles incident on the twin TPCs; (b) An 
arrangement of TPC detectors placed in an inverted position. 


twin TPCs, the electrons generated by ionization eventually 
produce induced signals at the anode readout strips, which 
carry energy and time information. Once the readout elec- 
tronics acquire and process the induction signal, the hit’ posi- 
tion of the incident particle and the drift time of the electrons 
in the two TPCs are available for further analysis. In exper- 
iments with low counting rates such as GET (~1 kHz) [23] 
and medium counting rates such as CEE (~10 kHz) [24, 25], 
readout electronics employ the trigger-based method. The 


server cluster completes data processing tasks, such as trans- 
mission, event building, and storage. However, as the count- 
ing rate increases, the trigger-based method may no longer 
meet experimental demands. Therefore, a trigger-less read- 
out scheme [26, 27] is proposed. For example, ATLAS (~40 
MHz total collision event rate) [28] and ALICE (~50 kHz) 
[16] upgrade projects have adopted the trigger-less readout 
system, allowing the analysis and filtering of massive data in 
the back-end. Compared with the conventional trigger-based 
mode, the trigger-less system collects all the raw data from 
the front-end electronics, which improves the efficiency of 
obtaining valid events during the experiment, and the trigger 
algorithms are realized by the software, thus providing greater 
flexibility. However, trigger-less systems require significantly 
more bandwidth for transmission and processing, as well as 
increased computing power. These requirements increase de- 
velopment difficulty and overhead for readout systems. To 
overcome the bottleneck of data transmission and storage in 
trigger-less mode, and accurately match events from the twin 
TPCs, we propose an online algorithm for event building. Our 
algorithm employs a combination of software and hardware 
to handle massive amounts of data using a hierarchical pro- 
cessing strategy, and addresses the issue of particle time dis- 
order at high counting rates. Specifically, data are compressed 
in each electronics readout unit and only key information is 
extracted, which significantly reduces the burden of trans- 
mission and storage. In addition, by utilizing information 
on the particle hit position and drift time obtained from the 
twin TPCs, we can effectively screen and reconstruct events, 
thereby recovering the original position information of the in- 
coming particles. Consequently, our readout system exhibits 
a high counting rate and low data volume. 


I. THE HIERARCHICAL DATA PROCESSING 
STRATEGY 


The readout requirements for twin TPC are listed in Ta- 
ble 1. Even if the front-end electronics performs zero com- 
pression [29, 30], a high counting rate places enormous pres- 
sure on the data acquisition system. Therefore, it is imper- 
ative to explore a hierarchical processing strategy in each 
readout unit to reduce transmission and storage pressures. 
The readout system for the twin TPCs employs a three-tiered 
structural design consisting of front-end electronics (FEEs), 
slave data acquisition units (slave DAQs), and a server, which 
facilitate data processing at the channel, detector, and subsys- 
tem levels, respectively. The schematic in Fig. 2 illustrates 
the hierarchy of the data processing system. The master data 
acquisition unit (master DAQ) of HFRS collects data from 
each subsystem server for downstream processing and stor- 
age. Because our work focuses on the subsystem, we do not 
discuss the master DAQ in detail in this paper. 

The main function of the FEE is to process the sig- 
nals from the detectors at the channel level. The 
FEE components include front-end amplifiers for MWDC 
(FEAMS) [31, 32], which have been used in CEE, multichan- 
nel analog-to-digital converters (ADCs) (ADS52J90) [33] 


TABLE 1. Twin TPC’s readout requirements. 


Item Value 
Maximum total counting rate of the detector 10 MHz 
Particle multiplicity 5 

Number of readout channels 500 
Single-channel average counting rate ~100 kHz 
Single-channel peak counting rate ~500 kHz 


and a field-programmable gate array (FPGA) (Xilinx Kintex- 
7-325T) [34]. A FEAM chip can handle eight channels over a 
wide dynamic range (~100 times). Each channel comprises a 
charge-sensitive amplifier (CSA), a shaper circuit, and a non- 
inverting driver circuit. The shaper circuit performs pole—zero 
cancellation, filtering, and shaping. The FEAM chip first am- 
plifies and shapes the weak signals from the readout strips, 
and then passes the output signal to the ADC. The chip count- 
ing rate is currently being optimized for HFRS-TPC require- 
ments. The design employs sub-board and mainboard con- 
figurations to facilitate readout capabilities for 32 or 64 chan- 
nels. The sub-board comprises four FEAM chips, whereas the 
main board can house two sub-boards simultaneously. The 
FPGA firmware features an online algorithm for extracting 
time and energy information to minimize data volume. 

The FPGA firmware of the FEE contains a self-triggering, 
data packaging, baseline subtraction, zero-compression, and 
a feature extraction modules. These modules cooperate to 
implement the initial data compression. The self-triggering 
module creates trigger signals for the data packing module 
at predetermined intervals, allowing the packing module to 
segment an uninterrupted input data stream based on these 
signals. Upon arrival of each trigger signal, a time window 
was initiated with a duration equal to the time interval be- 
tween the two triggers. The relevant data within the time 
window were packaged in a particular format. To minimize 
the overhead caused by packet headers, a time window of 
1000 ADC sampling intervals was utilized in this study. The 
counter of the system clock cycles is included in the packet 
header as a coarse timestamp supplied uniformly by the clock 
unit and calibrated at regular intervals. Subsequently, raw 
data are transmitted in packets. The baseline subtraction 
module adjusts the baseline by dynamically calculating the 
baseline noise using the currently available data. The zero- 
compression module discards invalid data by judging the va- 
lidity of the pulse waveform and then sends valid pulses to the 
feature extraction module to obtain high-precision pulse time 
and energy information. Finally, the information for each 
pulse is repackaged and sent to the slave DAQ. 

The slave DAQ executes the aggregation of detector- 
level data using a high-performance FPGA (Xilinx Kintex- 
UltraSCALE 060) [35], which aggregates and packages data 
from multiple FEEs with 10Gbps high-speed optical fiber 
links. In the trigger-less mode, the slave DAQ needs to sort 
the data in chronological order and channel order, which 
is convenient for downstream event screening and building. 
This was accomplished by developing a polling and sorting 
modules within the FPGA firmware of the slave DAQ. The 
polling module systematically extracts data from eight or 16 
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Fig. 2. (Color online)Schematic diagram of the data processing system hierarchy. 


FEEs, unpacks the data packets, and recovers the original 
pulse information. The pulse information is then transmit- 
ted to the sorting module, which arranges and orders all the 
channel pulses using the time information. Finally, the slave 
DAQ packages the sequenced data and transmits them to the 
server through a PCIe interface. 


The server plays a crucial role in issuing slow control in- 
structions for the electronics system and reconstructing sub- 
system events, and the foremost step of event building in- 
volves using a time window to merge multiplicity signals. 
Once this was completed, the center of gravity (CG) approach 
was used to determine the original incident positions of the 
particles on the readout strips in each detector. Finally, both 
the incident position and drift time from the two TPCs were 
fed into the hit-matching module to reconstruct the particle 
tracks. The amount of data saved was significantly minimized 
by removing invalid data. 


HI. DESIGN AND IMPLEMENTATION OF EVENT 
BUILDING ALGORITHM 


There are two types of interpretation of event building [36]. 
In brief, event building involves executing various software 
procedures on data, while, in broader sense, event building 
is determined by the system architecture of the software and 
hardware collaboration. Specifically, multilevel processing, 
including data packaging, feature extraction, and aggregation, 
is first performed on the FPGAs of each readout electron- 
ics unit. Then, software is leveraged to reconstruct physical 
events. Based on the hierarchical processing strategy outlined 
in Section II, the entire flow of the event-building algorithm 
includes signal preprocessing (adaptive baseline subtraction 
and zero compression), feature extraction (timing and energy 
extraction), multiplicity screening (fine timestamp sorting, 
fine timestamp merging, and CG for hitting position), and hit 
matching. Figure 3 illustrates the detailed algorithm pipeline 
and Fig. 4 displays the data format at each stage. A detailed 
description of the implementation of each module is provided 
below. 


A. Signal pre-processing 


To accommodate the discrepancies across channels, the 
channel-wise baseline and noise must be calculated within 
each packet. The baseline was determined by averaging the 
first 10 sample points of each channel, and the noise was cal- 
culated using the root-mean-square value of these points. In 
the selection process, if any sample point exceeds the mini- 
mum signal level required for electronics processing, the op- 
eration is stopped, and the region is deemed an effective sig- 
nal segment. The baseline and noise were recalculated by 
moving back 20 points (adjusted for the pulse width). Con- 
sidering both the baseline and noise, the threshold was mea- 
sured as the sum of the baseline and four times the noise. 
If the sample point exceeded the threshold, the threshold was 
subtracted, whereas a sample point that was less than or equal 
to the threshold was replaced with zero. 

Only signals with more than three consecutive sample 
points above the threshold were considered as valid pulse sig- 
nals. The packet payload records the total quantity of these 
overthreshold sample points (Clust_L), the serial number of 
the first overthreshold points in the packet (Clust_T), and the 
corresponding data (D). Multiplying Clust_T by the ADC 
sampling interval (50 ns) gives the offset time of the pulse 
relative to the coarse timestamp in the packet header. Fol- 
lowing zero compression, the total quantity of data (ChLen) 
in each channel should include all pulse data points and the 
corresponding descriptive information (Clust_L and Clust_T). 
This approach significantly reduced the amount of data. 


B. Feature extraction 


When operating in trigger-based mode, event screening is 
based on trigger signals; while in trigger-less mode, it relies 
on high-precision time information. Therefore, implement- 
ing a timing module is essential for enhancing the accuracy 
of the inferred Clust_T. The constant fraction discriminator 
(CFD) [37] divides the original signal into two paths: one 
delays the signal for a certain time, whereas the other inverts 
and weakens the signal. By merging these two paths, constant 
fraction timing is converted into zero-crossing timing, thereby 
effectively mitigating the effects of time wandering. Because 
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Fig. 3. (Color online)Workflow of the event building algorithm. 


(a) Raw Package 
Head FLAG P PKT BDadd TCode Len 


Chn3 | Chtens [D | pv | v P]... fo} 
py---- py 


Chn4 | ChLen4 [ D I D b An... 


FLAG—the flag of the packet Header 
P—the parity bit 

PKT— the packet type 

Bdadd—the address ID of the FEEs 


Tcode—the Coarse timestamp 
Len—the length of packet 
CRC—the CRC checksum 
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Fig. 4. (Color online)Data packet format at each stage. 


the ADC operates based on discrete sampling, the overthresh- 
old time derived from CFD 1s often imprecise. Therefore, two 
data points were located before and after the overthreshold 
time (Va and V»), and a linear equation or interpolation tech- 
nique was applied to achieve a more precise time measure- 
ment. During the initial validation of the software simulation, 
we derived a linear equation for the amplitude and time of 
the superimposed signals based on V, and Vp: The exact mo- 
ment of overthresholding corresponds to when the amplitude 
is zero. In the firmware, we use an interpolation method to 
approximate the threshold, which follows the principle of di- 
chotomy [38]. We first calculated the mid-amplitude point 
(Vinia) based on V, and V, by dividing the original time inter- 
val into two equal sub-regions (a and b). The original time 


interval is equivalent to the ADC sampling interval. If Vmia 
multiplied by Vz is negative, we can determine that the over- 
threshold moment is in sub-region b; otherwise, it is in the 
other sub-region. Therefore, we reduced the time interval 
corresponding to the overthreshold moment to half that of the 
original. By repeating this process n times, we can increase 
the timing accuracy by a factor of 2” compared to the original. 
Finally, the timing outcome was superimposed on Clust_T to 
provide a more precise pulse time. Clust_T is a 16-bit vari- 
able, and its high 10 bits are used to record the serial number 
of the data points within the packet (ranging from 0 to 1000) 
as the offset time relative to the coarse timestamp. The re- 
maining six bits are utilized to record high-precision timing 
outcomes, which subdivides 50ns into 64 equal parts, thereby 
improving the time resolution. 

Events can accumulate at high counting rates. The peak 
method can better distinguish overlapping signals compared 
to the area integration method. When several adjacent sample 
points first increase and then decrease, the peak (Clust_E) is 
at the vertex. After the energy extraction, only Clust_T and 
Clust_E were recorded for each pulse. We used the ChLen 
field to denote the pulse count for each channel to facilitate 
subsequent sorting. 


C. Multiplicity screen 


When particles enter the detector, there is usually an angle 
between the incoming direction and detector plane. Trans- 
verse diffusion occurs after the particle incidence, enabling 
multiple readout strips to produce induction signals. The sig- 
nals have consistent timestamps, but different levels of mag- 
nitude across multiple readout channels, which is called parti- 
cle multiplicity [22] as shown in Fig. 5(a). Thus, a multiplic- 
ity screen was required to recover the original hit position of 
the particle. When merging the particle multiplicity, the first 
step is to arrange the pulses of all channels within the packet 
in chronological and channel order. Because all the pulses 
in a packet share the same coarse timestamp (TCode), they 
only need to be sorted according to Clust_T, which is known 
as fine timestamp sorting. Figure 5(b) shows the workflow of 
fine timestamp sorting. Since the pulse data of each channel 
are stored in separate containers (FIFOs or arrays) in chrono- 
logical order, only the first pulse data in each container were 
considered for comparison. Subsequently, the pulse data cor- 
responding to the smallest moment obtained from this com- 
parison were moved from the old container to the new con- 
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Fig. 5. (Color online)Schematic diagram of (a) Particle multiplicity,(b) Fine timestamp sorting, and (c) Fine timestamp merging. 


tainer. In the second round, the second pulse data from the 
old container were compared with the first pulse data from 
the other containers. This process continues until all data in 
each container are compared and stored in the new container. 
If the pulse times in the two containers are equal, they are 
arranged in the channel order. If the time interval between 
the adjacent channel’s pulse signals is shorter than the des- 
ignated time window, they are categorized as the same event 
and can be merged according to the channel order, as illus- 
trated in Fig. 5(c). The Clust_T value was determined based 
on the fine timestamp of the central channel, and a count field 
was added to record the total number of pulses in each event. 
This information is then used in the CG to identify the hit’s 
position of the incident particle. 

Using CG (in Eq. (1)), the hit’s position (Xpit) of each par- 
ticle on the readout strip was determined based on the pre- 
viously obtained channel-fired ID and energy value. The 
channel-fired ID was used as the x; coordinate, and the en- 
ergy value acted as the weight /;. The drift time of the elec- 
trons is measured as the sum of the coarse timestamp and 
fine timestamp. Both the hit’s position and drift time from 
the twin TPCs were relayed to the hit-matching module, and 
the two-dimensional trace of the incoming particles was ulti- 
mately reconstructed. This process also helps discard invalid 
or irrelevant data. 


Xn = X i Ei/ XE; 


(1) 


D. Hit matching 


In Eq. (2), the drift distance (L) and drift speed (Varift) of 
electrons for twin TPCs are predetermined. As a result, the 
sum of the drift time (tes) remains fixed and can be used 
as a constraint to achieve hit-matching [22] of twin TPCs. 
Although the plastic scintillator detector (Zo) can provide 
a highly accurate reference time (to), a large fluctuation in 
the particle flight time can negatively impact its accuracy. 


Moreover, to information is transferred directly to the master 
DAQ, and system-wise event building is achieved by combin- 
ing the data from the subsystem servers, which may reduce 
timeliness. Considering the aforementioned factors, we pro- 
pose a novel hit-matching algorithm for reconstructing parti- 
cle tracks without to information. This allows event building 
to be performed on the subsystem server, thus enabling addi- 
tional data compression and easing the computational burden 
on the master DAQ. 


tes = tu + ta — 2to = 


(2) 
Udrift 
where tu, ta is the absolute drift time of electrons from the 
twin TPCs and tọ is the reference time from detector Tọ. 
The matching algorithm can be classified into two methods 
based on whether the factor tọ is incorporated: the relative 
time method and the absolute time method. 


1) the relative time method 


(a) Search for all hits from twin TPCs in the range 
[to — 50+, to + tamax + 50+] according to to. 


(b) Time matching: For the searched hits, use their 
drift times and tọ to calculate tes, then select the 
hit combinations within 50s. 


(c) Position matching: For hits that meet the time 
matching conditions, choose the combination 
with the smallest hit distance. 

(d) Positioning: According to the drift time of the 
chosen hit, combined with the drift distance and 
drift speed, calculate the hit position in the Y di- 
rection. 


2) the absolute time method 


(a) Search for all all hits from the second TPC in the 
range [tu —tamax — 601, ty +tamax +60;] according 
to tu. 


(b) Time matching: For the searched hits, calculate 
jt — ta| and select the hit combinations within 
120 te 


(c) Position matching: For hits that meet the time 
matching conditions, choose the combination 
with the smallest hit distance and minimize the 
energy difference. 


(d) Positioning: According to the drift time of the 
chosen hit, combined with the drift distance and 
drift speed, calculate the hit position in the Y di- 
rection. 


Otas Z V 20:2 + 4o? (3) 


where 0+, Oto, Ctes are the time resolutions of the TPC detec- 
tor, Tọ detector, and tes variable, respectively. 


IV. VERIFICATION AND ANALYSIS 


As the development of TPC and front-end electronics is 
underway, we were unable to test the algorithm on the cor- 
responding devices. Alternatively, we first conducted a sim- 
ulation with software to verify the performance of the pro- 
posed event-building algorithm and then performed prelimi- 
nary testing with existing laboratory equipment. 


A. Simulation study 


We use the Monte Carlo method [39] to generate the twin 
TPCs’ simulation packet. When particles enter the twin 
TPCs, electrons generated by ionization drift up and down, 
with the drift time based on their hits’ position in the Y di- 
rection, the total drift distance remains unchanged. The fired 
readout channels were determined by the hits’ position on the 
readout strips, incidence angle, and transverse diffusion, with 
a maximum of five channels. Therefore, the drift time and 
fired channels can be set according to the length and width 
of the TPCs’ incidence cross section. The FEAM chip ex- 
hibited a peak time of 160 ns and a falling edge at approxi- 
mately 320 ns. As a result, the CR — (RC)? characteristic 
equation is employed to derive accurate pulse data, and the 
signal amplitude range is defined by the selected ADC bit 
width. When the system’s counting rate reaches 1 MHz, a 
particle is expected to hit the twin TPCs every 1000 ns on av- 
erage. Therefore, two sets of simulation waveform data were 
simultaneously produced every 1000 ns ( to ). The amplitude, 
fired channels, and drift time (with tọ as the reference) for 
each waveform were randomly generated to mimic a real sce- 
nario accurately. Baseline noise obtained from existing FEE 
measurements was added to the simulation data to enhance 
authenticity. In addition, simulation data were produced for 
each TPC using only one FEE consisting of 32 channels. 

Figure 6 shows the overall distribution of the simulation 
data for a pair of complete packets for the twin TPCs. Mul- 
tiple simulation waveforms were created within a 50us pe- 


riod, traversing 32 different channels with varying peak am- 
plitudes and covering most of the ADC range (bit width of 
10). Figure 7(a) shows the simulated waveforms of the twin 
TPCs’ central channel fired at a specific time, which ex- 
hibit high similarity with only minor amplitude variations. 
The phase relationship between the two drift times covers 
the three cases of overrun, approach, and lag that satisfy the 
simulation requirements. Furthermore, Figure 7(b) shows 
the simulated waveforms after removing the baseline, which 
dropped to zero. The two TPC packets contained 219 and 
210 pulses, respectively. In other words, an average of 6~7 
pulses were generated on each readout channel in 50us time 
for both TPCs, so the average counting rate greater than 100 
kHz/channel as expected, indicating that the simulation data 
closely resembles the actual situation. 


Fig. 6. (Color online)Distribution of (a) The Ist TPC’ simulation 
data and (b) The 2nd TPC’ simulation data. 
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Fig. 7. (Color online) Simulated waveforms of the twin TPCs’ cen- 
tral channel fired. (a) Raw simulated waveforms; (b) Waveforms 
after subtracting the baseline. 


After merging the multiplicities, the first TPC collected 50 
sets of event data and the second TPC recorded 49 sets of 
event data. Subsequently, the hits’ positions and drift times 
of all event data are transmitted to the hit-matching module, 
where the absolute time method is employed for matching. 
49 data sets were successfully matched, and only one event 
dataset was discarded due to matching failure. Finally, based 
on the matched drift time pairs, the drift distances were cal- 
culated, and the particle hit positions in the Y -direction were 


recovered, as depicted in Fig. 8. When the incident particle 
strikes the twin TPCs, three drift distances are possible: the 
first TPC’s drift distance may be greater or less than that of the 
second TPC, or the two drift distances may be equal; however, 
the sum of the drift distances in the two TPCs is always the 
maximum drift distance. Figure 8 shows three typical cases: 
(a) when the incident particle collides with the center of the 
twin TPCs and the two drift distances are equal; (b) when 
the particle impacts a position close to the first TPC’s read- 
out strip, the second TPC experiences the maximum drift dis- 
tance; and (c) if the hit is offset to the second TPC’s readout 
strip, the first TPC undergoes a larger drift distance than the 
second TPC. Theoretically, the particle should hit the same 
position on the readout strips of both TPCs. However, in 
practice, a slight deviation between the two hits’ positions 
is possible due to oblique incidence. Using the relative-time 
approach, the same datasets produced consistent results, as 
shown in Fig. 8. 
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Fig. 8. (Color online)Matched results obtained using the absolute 
time method from the twin TPCs’ simulation packets. (Two different 
symbols of the same color represent the hits’ positions of the same 
incident particle in two TPCs, respectively.) 


To verify the performance of event-building algorithm at a 
high counting rate, the hit probability of the readout channel 
was adjusted from a uniform to Gaussian distribution. Fur- 
thermore, the probability of the readout channels in the cen- 
tral region being fired increases. As illustrated in Fig. 9(a), 
four to five channels within the two TPC packets were fired 
more than 25 times in 50us, indicating a counting rate of 500 
kHz/channel. Three sets of packets were analyzed and the 
matching results obtained using both methods were consis- 
tent, as shown in Fig. 9(b). It was observed that both match- 
ing algorithms could effectively recover the two-dimensional 
traces of incident particles under normal conditions. How- 
ever, the relative time method is more suitable for severe cases 
because it employs a more rigorous equation as a constraint. 
Therefore, the absolute time method can be utilized in the 
subsystem server to initially match hits, remove inconsistent 
data, and further reduce data volume. In addition, the relative- 
time technique can be employed in the master DAQ to accom- 
plish system-wise hit identification. 

As an illustration, the current simulation parameters, that 
is, a single data packet containing 32 channels with 1000 sam- 
ple points each and an additional 16-bit descriptive informa- 
tion as the raw data volume were used to evaluate the data 
compression capability for the event-building algorithm. The 
evaluation was based on the following assumptions. 
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Fig. 9. (Color online) (a) Number of fired for each channel in the 
twin TPCs; (b) Particles hit positions in the Y -direction obtained by 
hit matching in the 1st TPC: The red symbol represents the results 
acquired by the relative time method, while the blue symbol shows 
the outcomes obtained using the absolute time method. 


1) A total of 50 particles are incident in 50 microseconds 
and each particle hits 5 readout channels, generating a total of 
250 valid pulses. 

2) After removing the baseline, an average of 15 sampling 
points were retained for each valid pulse. 

3) The fine timestamps and energy information were each 
16 bits. 

4) The position information for the hits was 10 bits in both 
the X (readout strip) and Y directions. 

The compression ratios for several stages are listed in Ta- 
ble 2, which indicates that the event-building algorithm can 
shrink 99% of the raw data and 97% of the zero-compressed 
data. 


B. Test Verification 


As shown in Fig. 10, a primary test system was set up in 
the laboratory using existing electronic devices. Note that 
the chassis of the remote server and slave DAQ contained 
within it are not visible in this figure. A signal generator 
(keysight33522B) [40] was employed to create a trigger sig- 
nal and two-pulse signals. The two pulse signals were fed into 
the two FEAM chip’s analog inputs on the FEE to emulate the 
signals from two TPCs, respectively. The input signals were 
first converted into digital form by the ADC, and then trans- 
mitted using an optical fiber to the slave DAQ for aggregation 
and packaging. Finally, the data were sent to a remote server 


TABLE 2. The compression ratios for several major processing phases. 


Data-processing phase 
Raw data 
Zero compression 
Timing and energy extraction 
CG for hitting position 
Hit matching 


via a PCle interface. The drift time of the electrons generated 
by the primary ionizing particles was simulated by adjusting 
the delay time of the input signal relative to the trigger sig- 
nal. Multiple pulse signals collected by the FEE were then 
spliced together to mimic a trigger-less mechanism. With a 
sampling frequency of 50 MHz, the ADC collects 100 sam- 
ples per pulse signal, implying a pulse period of 2 us and a 
counting rate of 500 kHz/channel. Figure | 1(a) illustrates the 
initial waveforms of the two FEE channels’ output at specific 
times. Next, the adaptive baseline subtraction module ana- 
lyzes and removes any baselines from the data. The outcomes 
of this procedure are shown in Fig. 11(b). The results indicate 
the effectiveness of the module in cleaning the baseline noise. 
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Fig. 11. (Color online) (a) Original waveforms; (b) Waveforms after 
subtracting the baseline. 


To verify the performance of the event-building algorithm 
using experimental data, we selected eight typical delay time 
pairs and five different amplitude combinations of input sig- 
nals to conduct 40 tests, each consisting of eight pulses. The 
additional delay caused by the readout system must be sub- 
tracted before data processing. After processing, 320 test 
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Fig. 12. (Color online)Particles hit positions in the Y -direction ob- 
tained by hit matching in the 1st TPC: The red symbol represents the 
results acquired by the relative time method, while the blue symbol 
shows the outcomes obtained using the absolute time method. 


pulses were sent to the hit matching module, all of which 
were accurately matched using either absolute or relative time 
methods. The Y-direction positions where the particles hit 
were determined using 320 pairs of drift times. From this 
dataset, 20% of the data were extracted and plotted in Fig. 
12, which shows that the results of the two matching algo- 
rithms were highly consistent. The key test parameters and 
compression ratios are shown in Table 3. The preliminary ev- 
idence shows that the event-building algorithm can effectively 
compress data. 


TABLE 3. The key parameters and compression ratios 


Item Value 

Counting rate ~500 kHz/channel 
Number of channels l 

Pulse width (ns) ~500 

ADC sampling rate (MH2) 50 

The raw packet payload 8016 bits (100%) 


The packet payload after zero compression 2192 bits (~28%) 
The packet payload after hit-matching 160 bits (~2%) 


V. CONCLUSION 


This paper presents an event building algorithm that aims 
to tackle two major challenges: the transmission and storage 
of large amounts of data generated by the trigger-less read- 
out system, as well as the issue of particle time disorder at 
high counting rates. The algorithm conducts hierarchical pro- 
cessing and compression of data on FEEs, slave DAQs, and 
a server platform, and finally matches the twin TPCs’ events 


on the server to reconstruct the two-dimensional traces of in- 
cident particles. The processing flow of the algorithm was 
fully implemented by software. The data generated from both 
the simulation and preliminary laboratory tests were accu- 
rately matched using the algorithm, and the results suggested 
that it could effectively reduce the data volume by at least 
98%. Potential future directions for improvement include im- 
plementing firmware logic for timing, energy extraction, and 
sorting by utilizing FPGAs on both FEEs and slave DAQs. 
This leverages the pipeline parallelism and low latency of the 
FPGA to enhance the overall system performance. In addi- 
tion, the use of GPU multicore parallel computing [41, 42] to 
improve the speed of multiplicity merging and hit matching 
modules is also a direction for future research. 
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